A smoothing model for sample disclosure risk estimation

نویسندگان

  • Yosef Rinott
  • Natalie Shlomo
چکیده

When a sample frequency table is published, disclosure risk arises when some individuals can identified on the basis of their values in certain attributes in the table called key variables, and then their values in other attributes may be inferred, and their privacy is violated. On the basis of the sample to be released, and possibly some partial knowledge of the whole population, an agency which considers releasing the sample, has to estimate the disclosure risk. Risk arises from non-empty sample cells which represent small population cells and from population uniques in particular. Therefore risk estimation requires assessing how many of the relevant population cells are likely to be small. Various methods have been proposed for this task, and we present a method in which estimation of a population cell frequency is based on smoothing using a local neighborhood of this cell, that is, cells having similar or close values in

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Individual Disclosure Risk Measures Based on Log-Linear Models

Dissemination of microdata files should be constrained to the confidentiality pledge under which a statistical agency collects survey data. To protect the confidentiality of respondents, statistical agencies perform a two-stage statistical disclosure control procedure. In the first stage, with respect to a disclosure scenario, the risk of disclosure of each unit is estimated. After the removal ...

متن کامل

Two-step Smoothing Estimation of the Time-variant Parameter with Application to Temperature Data

‎In this article‎, ‎we develop two nonparametric smoothing estimators for parameter of a time-variant parametric model‎. ‎This parameter can be from any parametric family or from any parametric or semi-parametric regression model‎. ‎Estimation is based on a two-step procedure‎, ‎in which we first get the raw estimate of the parameter at a set of disjoint time...

متن کامل

Presenting a model for Multiple-step-ahead-Forecasting of volatility and Conditional Value at Risk in fossil energy markets

Fossil energy markets have always been known as strategic and important markets. They have a significant impact on the macro economy and financial markets of the world. The nature of these markets are accompanied by sudden shocks and volatility in the prices. Therefore, they must be controlled and forecasted by using appropriate tools. This paper adopts the Generalized Auto Regressive Condition...

متن کامل

WP. 10 ENGLISH ONLY UNITED NATIONS STATISTICAL COMMISSION and ECONOMIC COMMISSION FOR EUROPE CONFERENCE OF EUROPEAN STATISTICIANS EUROPEAN COMMISSION STATISTICAL OFFICE OF THE EUROPEAN COMMUNITIES (EUROSTAT)

The disclosure risk involved in releasing data which consist of a sample from some population depends on both the sample and the population. When the sample is fully known, with only partial or no information on the population, a major problem in Statistical Disclosure Control (SDC) is the estimation of disclosure risk on the basis of the sample. Considering data in the form of a frequency tabl...

متن کامل

A MODIFICATION ON RIDGE ESTIMATION FOR FUZZY NONPARAMETRIC REGRESSION

This paper deals with ridge estimation of fuzzy nonparametric regression models using triangular fuzzy numbers. This estimation method is obtained by implementing ridge regression learning algorithm in the La- grangian dual space. The distance measure for fuzzy numbers that suggested by Diamond is used and the local linear smoothing technique with the cross- validation procedure for selecting t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006